Edward Vytlacil
Event Study: Elon Musk Tweet
Audit Study: Neumark, Bank, and Van Nort (1996)
Elon Musk tweets image of a dog in a sweater on cover of a magazine at 5:47pm EST (10:47PM UTC) 1-28-2021
Tweet interpreted as supporting the cryptocurrency Dogecoin
Do markets react to tweet?
Example of an Event Study in finance.
This application is based on blog post “Causal effect of Elon Musk tweets on Dogecoin price” by Fabian Dablander.
Consider price of Dogecoin in two hour window before and after tweet.
Price of Dogecoin almost doubles.
Model for returns with no event:
“Constant Mean Return Model”, \[ R_{t} = \mu + \epsilon_{t}\] with \(\epsilon_t \sim N(0, \sigma_2)\), i.i.d. over \(t\).
Define
Take estimation window to be two hours before tweet, estimate \(\mu\) and \(\sigma^2\) using returns in estimation window.
Take event window to be two hours post-tweet.
Other choices for windows? Threats to validity?
Model for AR during estimation window, \(-120 \le t < 0\), \[AR_t \sim N(0, \sigma_2), ~~\mbox{i.i.d. } ~~ t,\]
Model for AR during event window, \(0\le t \le 120\), \[AR_t \sim N(\theta,\nu^2), ~~\mbox{i.i.d. } ~~ t,\] Consider: \[\mathbb{H}_0: \theta=0, \nu^2=\sigma^2, ~~ \mbox{vs} ~~\mathbb{H}_1: \theta \ne 0 ~\mbox{or}~ \nu^2 \ne \sigma^2\]
Steps for Hypothesis Testing:
# No. obs in event window
L <- nrow(subset(df.R,date>tweet.date))
# Cumulative AR in event window
CAR <- sum(subset(df.R,date>tweet.date)$AR)
# inference for CAR, at 0.05 level
test.stat <- CAR/sqrt(L*doge.var)
p.value <- 2* ( 1 - pnorm(abs(test.stat)))
test.stat[1] 6.220559
[1] 1.959964
[1] 0.0000000004953882
# equivalentlly, using mean
test.stat2 <- (CAR/L)/sqrt(doge.var/L)
p.value2 <- 2* ( 1 - pnorm(abs(test.stat2)))
test.stat2[1] 6.220559
[1] 0.0000000004953882
Threats to validity?
Could we follow same procedure with daily data?
What if we only had one observation in event window?
What if we didn’t assume returns are normally distributed?
Event Study: Elon Musk Tweet
Audit Study: Neumark, Bank, and Van Nort (1996)
“Two individuals (auditors or testers) are matched for all relevant personal characteristics other than the one that is presumed to lead to discrimination, e.g., rate, ethnicity, gender. They then apply for a job, a housing unit, or a mortgage, or begin to negotiate for a good or service. The results they achieve and the treatment they receive in the transaction are carefully observed, documented, and analyzed to determine if the outcomes reveal patterns of differential treatment on the basis of the trait studied and/or protected by anti-discrimination laws. . . ’’ Fix and Struyk (1993) link
Use pairs of actors, one minority and one non-minority, with similar relevant attributes.
Send a pair of actors to apply for each job (or loan, etc)
See if differences in outcomes for minority vs non-minority actors.
Neumark, Bank, and Van Nort (1996):
Sex Discrimination in Restaurant Hiring: An Audit Study
Investigates sex-discrimination in restaurant hiring of waiters.
Selected 65 restaurants in Philadelphia, classified as high-, medium, or low-price.
Four auditors, two man and two woman college students.
Neumark, Bank, and Van Nort (1996):
Sex Discrimination in Restaurant Hiring: An Audit Study
Neumark, Bank, and Van Nort (1996):
Sex Discrimination in Restaurant Hiring: An Audit Study
| Audit of Restaurants (full sample) | ||||||
| Offer? | High-Price (n= 46 ) | Med-Price (n= 42 ) | Low-Price (n= 42 ) | |||
|---|---|---|---|---|---|---|
| No. | Frac | No. | Frac | No. | Frac | |
| Neither | 22 | 0.48 | 18 | 0.43 | 26 | 0.62 |
| Both | 2 | 0.04 | 8 | 0.19 | 4 | 0.10 |
| Only Man | 20 | 0.43 | 12 | 0.29 | 0 | 0.00 |
| Only Woman | 2 | 0.04 | 4 | 0.10 | 12 | 0.29 |
| Source: Neumark, Bank, and Van Nort (1996). | ||||||
How to conduct inference?
Steps for Hypothesis Testing:
Consider Paired Sign Test:
State \(\mathbb{H}_0\) and \(\mathbb{H}_1\) hypotheses;
Condition on only man or only woman outcomes.
Use binomial test on conditional sample.
Exact binomial test
data: 20 and 22
number of successes = 20, number of trials = 22, p-value = 0.0001211
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.7083873 0.9887944
sample estimates:
probability of success
0.9090909
Exact binomial test
data: 12 and 16
number of successes = 12, number of trials = 16, p-value = 0.07681
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.4762292 0.9273380
sample estimates:
probability of success
0.75
Exact binomial test
data: 0 and 12
number of successes = 0, number of trials = 12, p-value = 0.0004883
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.0000000 0.2646485
sample estimates:
probability of success
0
| Audit of Restaurants (conditional sample) | ||||||
| Offer? | High-Price (n= 22 ) | Med-Price (n= 16 ) | Low-Price (n= 12 ) | |||
|---|---|---|---|---|---|---|
| No. | Frac | No. | Frac | No. | Frac | |
| Only Man | 20 | 0.91 | 12 | 0.75 | 0 | 0 |
| Only Woman | 2 | 0.09 | 4 | 0.25 | 12 | 1 |
| p-value symmetry | 0 | 0.077 | 0 | |||
| Source: Neumark, Bank, and Van Nort (1996). | ||||||
Very different results for the two woman auditors.
They suspect difference do to one woman auditor being Asian-American.
Also consider analysis dropping the audits involving the Asian-American auditor.
| Audit of Restaurants (Dropping Woman 2) | ||||||
| Offer? | High-Price (n= 22 ) | Med-Price (n= 22 ) | Low-Price (n= 12 ) | |||
|---|---|---|---|---|---|---|
| No. | Frac | No. | Frac | No. | Frac | |
| Neither | 10 | 0.45 | 8 | 0.36 | 9 | 0.75 |
| Both | 2 | 0.09 | 6 | 0.27 | 1 | 0.08 |
| Only Man | 10 | 0.45 | 4 | 0.18 | 0 | 0.00 |
| Only Woman | 0 | 0.00 | 4 | 0.18 | 2 | 0.17 |
| Source: Neumark, Bank, and Van Nort (1996). | ||||||
How to conduct inference?
Exact binomial test
data: 10 and 10
number of successes = 10, number of trials = 10, p-value = 0.001953
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.6915029 1.0000000
sample estimates:
probability of success
1
Exact binomial test
data: 4 and 8
number of successes = 4, number of trials = 8, p-value = 1
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.1570128 0.8429872
sample estimates:
probability of success
0.5
Exact binomial test
data: 0 and 2
number of successes = 0, number of trials = 2, p-value = 0.5
alternative hypothesis: true probability of success is not equal to 0.5
95 percent confidence interval:
0.0000000 0.8418861
sample estimates:
probability of success
0
| Audit of Restaurants (conditional sample) | ||||||
| Offer? | High-Price (n= 10 ) | Med-Price (n= 8 ) | Low-Price (n= 2 ) | |||
|---|---|---|---|---|---|---|
| No. | Frac | No. | Frac | No. | Frac | |
| Only Man | 10 | 1 | 4 | 0.5 | 0 | 0 |
| Only Woman | 0 | 0 | 4 | 0.5 | 2 | 1 |
| p-value symmetry | 0.002 | 1 | 0.5 | |||
| Source: Neumark, Bank, and Van Nort (1996). | ||||||
Is not rejecting the null evidence that the null is true?
Including both woman auditors, we reject null for low-price restaurants (p-value: 0.00049)
Including only non-Asian-American auditor, we can’t reject null (p-value: 0.5)
Important to consider power of test (see Handout 3).
Can be hard to make each pair of actors similar in relevant attributes, and attempting to do so can exacerbate effect of those attributes that are not similar (Heckman (1998) link).
Not double-blind. Behavior/belief of actors may lead to differences in outcomes.
Expensive, resulting in small sample sizes.
Not appropriate for all situations – for example, use of police force, criminal sentencing decisions. .. .
More recent “Correspondence methodology” alleviates first three issues, while exacerbating fourth.
Econ 123: Intermediate Econometrics and Data Analysis